-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

Character Set

Technically, a character set is a group elements used to represent information. From a practical standpoint, character sets define the languages supported by applications and operating systems.

Currently, ISO-8859-1 is the preferred character set on the Internet. It contains all of the characters necessarily for supporting most European and Latin American languages. Other extensions of ISO-8859 support languages such as Arabic, Greek and Hebrew. All of these use 8 bits for representing data.

Recently, for wider portability, there has been a call for a standard that would support most of the world's languages, including Japanese and Chinese. This standard, Unicode (ISO 10646) is an extension to ISO 8859-1 using wide characters. Unicode is based on a 16-bit unit of encoding. Although Unicode offers significant advantages, it is controversial because few applications and operating systems support wide characters.

URLs:

W3E Resources:

international standards

-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-=-

E-Mail: The World Wide Web Encyclopedia at wwwe@tab.com
E-Mail: Charles River Media at chrivmedia@aol.com
Copyright 1996 Charles River Media. All rights reserved.
Text - Copyright © 1995, 1996 - James Michael Stewart & Ed Tittel.
Web Layout - Copyright © 1995, 1996 - LANWrights & IMPACT Online.
Revised -- February 20th, 1996